翻訳と辞書 |
Apache Flink : ウィキペディア英語版 | Apache Flink
Apache Flink, like Hadoop and Spark, is a community-driven open source framework for distributed Big Data Analytics. The core of Apache Flink is a distributed streaming dataflow engine written in Java and Scala.〔(【引用サイトリンク】title=Apache Flink: Scalable Batch and Stream Data Processing )〕〔(【引用サイトリンク】title=apache/flink )〕 It aims to bridge the gap between MapReduce-like systems and shared-nothing parallel data base systems. Therefore, Flink executes arbitrary dataflow programs in a data-parallel and pipelined manner.〔Alexander Alexandrov, Rico Bergmann, Stephan Ewen, Johann-Christoph Freytag, Fabian Hueske, Arvid Heise, Odej Kao, Marcus Leich, Ulf Leser, Volker Markl, Felix Naumann, Mathias Peters, Astrid Rheinländer, Matthias J. Sax, Sebastian Schelter, Mareike Höger, Kostas Tzoumas, and Daniel Warneke. 2014. ''The Stratosphere platform for big data analytics''. The VLDB Journal 23, 6 (December 2014), 939-964. (DOI )〕 Flink's pipelined runtime system enables the execution of bulk/batch and stream processing programs.〔(【引用サイトリンク】title=On Apache Flink. Interview with Volker Markl. )〕 Furthermore, Flink's runtime supports the execution of iterative algorithms natively.〔Stephan Ewen, Kostas Tzoumas, Moritz Kaufmann, and Volker Markl. 2012. ''Spinning fast iterative data flows''. Proc. VLDB Endow. 5, 11 (July 2012), 1268-1279. (DOI )〕 Flink programs can be written in Java or Scala and are automatically compiled and optimized〔Fabian Hueske, Mathias Peters, Matthias J. Sax, Astrid Rheinländer, Rico Bergmann, Aljoscha Krettek, and Kostas Tzoumas. 2012. ''Opening the black boxes in data flow optimization''. Proc. VLDB Endow. 5, 11 (July 2012), 1256-1267. (DOI )〕 into dataflow programs that are executed in a cluster or cloud environment.〔Daniel Warneke and Odej Kao. 2009. ''Nephele: efficient parallel data processing in the cloud''. In Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS '09). ACM, New York, NY, USA, , Article 8 , 10 pages. (DOI )〕 Flink does not provide its own data storage system, input data must be stored in a distributed storage system like HDFS or HBase. For data stream processing, Flink consumes data from (reliable) message queues like Kafka. ==Development== Apache Flink is developed under the Apache License 2.0〔(【引用サイトリンク】title=ASF Git Repos - flink.git/blob - LICENSE )〕 by the Apache Flink Community within the Apache Software Foundation. The project is driven by the Berlin based start-up company (dataArtisans ), 20 committers, are more then (130 contributors ).
抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Apache Flink」の詳細全文を読む
スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース |
Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.
|
|